NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Data Driven CI System Design and Procurement with Open XDMoD

https://doi.org/10.1145/3708035.3736000

Furlani, Thomas R; Jones, Matthew D; White, Joseph P (July 2025, ACM)

The ability to apply data-driven design principles to customize new CI investment to best serve the intended community as well as provide fact-based justification for its need is critical given the important role it plays in research and economic development and its high cost. Here we describe a data driven approach to CI sys- tem design based on workload analyses obtained using the popular open-source CI management tool Open XDMoD, and how it was leveraged in a procurement to provide end-users with an additional 5.6 million CPU hours annually, with subsequent procurements following similar design goals. In addition to system design, we demonstrate Open XDMoD’s utility in providing fact-based justifi- cation for the CI procurement through usage metrics of existing CI resources.
more » « less
Free, publicly-accessible full text available July 18, 2026
Predictive Modeling of HPC Job Queue Times: Improving User Decision-Making and Resource Utilization

https://doi.org/10.1145/3708035.3736067

Gaikwad, Bipin; Simakov, Nikolay A; Furlani, Thomas; White, Joseph Patrick; Patra, Abani (July 2025, ACM)

This work presents a framework for estimating job wait times in High-Performance Computing (HPC) scheduling queues, leverag- ing historical job scheduling data and real-time system metrics. Using machine learning techniques, specifically Random Forest and Multi-Layer Perceptron (MLP) models, we demonstrate high accuracy in predicting wait times, achieving 94.2% reliability within a 10-minute error margin. The framework incorporates key fea- tures such as requested resources, queue occupancy, and system utilization, with ablation studies revealing the significance of these features. Additionally, the framework offers users wait time esti- mates for different resource configurations, enabling them to select optimal resources, reduce delays, and accelerate computational workloads. Our approach provides valuable insights for both users and administrators to optimize job scheduling, contributing to more efficient resource management and faster time to scientific results.
more » « less
Free, publicly-accessible full text available July 18, 2026
Overview of ACCESS allocated cyberinfrastructure usage

https://doi.org/10.1145/3626203.3670521

White, Joseph Patrick; Weeden, Aaron; Deleon, Robert; Furlani, Thomas; Jones, Matthew D (July 2024, ACM)

ACCESS is a program established and funded by the National Sci- ence Foundation to help researchers and educators use the NSF na- tional advanced computing systems and services. Here we present an analysis of the usage of ACCESS allocated cyberinfrastructure over the first 16 months of the ACCESS program, September 2022 through December 2023. For historical context, we include analyses of ACCESS and XSEDE, its NSF funded predecessor, for the ten-year period from January 2014 through December 2023. The analyses in- clude batch compute resource usage, cloud resource usage, science gateways, allocations, and users.
more » « less
Full Text Available
The Data Analytics Framework for XDMoD

https://doi.org/10.1007/s42979-024-02789-2

Weeden, Aaron; White, Joseph P; DeLeon, Robert L; Rathsam, Ryan; Simakov, Nikolay A; Saeli, Conner; Furlani, Thomas R (June 2024, SN Computer Science)

Full Text Available
First Impressions of the NVIDIA Grace CPU Superchip and NVIDIA Grace Hopper Superchip for Scientific Workloads

https://doi.org/10.1145/3636480.3637097

Simakov, Nikolay A.; Jones, Matthew D.; Furlani, Thomas R.; Siegmann, Eva; Harrison, Robert J. (January 2024, ACM)

The engineering samples of the NVIDIA Grace CPU Superchip and NVIDIA Grace Hopper Superchips were tested using different benchmarks and scientific applications. The benchmarks include HPCC and HPCG. The real application-based benchmark includes AI-Benchmark-Alpha (a TensorFlow benchmark), Gromacs, OpenFOAM, and ROMS. The performance was compared to multiple Intel, AMD, ARM CPUs and several x86 with NVIDIA GPU systems. A brief energy efficiency estimate was performed based on TDP values. We found that in HPCC benchmark tests, the per-core performance of Grace is similar to or faster than AMD Milan cores, and the high core count often allows NVIDIA Grace CPU Superchip to have per-node performance similar to Intel Sapphire Rapids with High Bandwidth Memory: slower in matrix multiplication (by 17%) and FFT (by 6%), faster in Linpack (by 9%)). In scientific applications, the NVIDIA Grace CPU Superchip performance is slower by 6% to 18% in Gromacs, faster by 7% in OpenFOAM, and right between HBM and DDR modes of Intel Sapphire Rapids in ROMS. The combined CPU-GPU performance in Gromacs is significantly faster (by 20% to 117% faster) than any tested x86-NVIDIA GPU system. Overall, the new NVIDIA Grace Hopper Superchip and NVIDIA Grace CPU Superchip Superchip are high-performance and most likely energy-efficient solutions for HPC centers.
more » « less
ACCESS: Advancing Innovation: NSF’s Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support

https://doi.org/10.1145/3569951.3597559

Boerner, Timothy J.; Deems, Stephen; Furlani, Thomas R.; Knuth, Shelley L.; Towns, John (August 2023, PEARC '23: Practice and Experience in Advanced Research Computing)

Full Text Available
Are we ready for broader adoption of ARM in the HPC community: Performance and Energy Efficiency Analysis of Benchmarks and Applications Executed on High-End ARM Systems

https://doi.org/10.1145/3581576.3581618

Simakov, Nikolay A.; Deleon, Robert L.; White, Joseph P.; Jones, Matthew D.; Furlani, Thomas R.; Siegmann, Eva; Harrison, Robert J. (February 2023, Proceedings of the HPC Asia 2023 Workshops (HPC Asia '23 Workshops))

Full Text Available
Performance Optimization of the Open XDMoD Datawarehouse

https://doi.org/10.1145/3491418.3530290

Dean, Gregary; Moraes, Joshua; White, Joseph; Deleon, Robert; Jones, Matthew; Furlani, Thomas (July 2022, Proceedings of the Practice and Experience in Advanced Research Computing, ser PEARC '22)

Full Text Available
Monitoring and Analysis of Power Consumption on HPC Clusters using XDMoD

https://doi.org/10.1145/3311790.3396624

White, Joseph P.; Innus, Martins; Deleon, Robert L.; Jones, Matthew D.; Furlani, Thomas R. (July 2020, Proceedings of the Practice and Experience in Advanced Research Computing, ser PEARC '20)

Full Text Available
Towards Performant Workflows, Monitoring and Measuring

https://doi.org/10.1109/ICCCN49398.2020.9209647

Sperhac, Jeanette; DeLeon, Robert L.; White, Joseph P.; Jones, Matthew; Bruno, Andrew E.; Ivey, Renette Jones; Furlani, Thomas R.; Bard, Jonathan E.; Chaudhary, Vipin (August 2020, Proceedings of the 29th International Conference on Computer Communications and Networks, ser IEEE ICCCN '20)

Full Text Available

« Prev Next »

Search for: All records